-
Notifications
You must be signed in to change notification settings - Fork 116
Add voice interactions with Gemini Live and ros-mcp-server to Gemini example. #115
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add voice interactions with Gemini Live and ros-mcp-server to Gemini example. #115
Conversation
Thank you for your contribution, @tracelarue. I will give it a try soon. @rjohn-v — I’d suggest adding a client/ folder in the repository to store installation packages and runnable clients. This would help keep different client implementations (e.g., Gemini API client) and their installation steps organized in one place. I’m not sure I’d keep this under examples/. |
I connected issue robotmcp/robot-mcp-client#20 to this PR. |
@tracelarue @rjohn-v Another good next step would be to provide a |
@stex2005 Thank you for the review and feedback. I'll work on getting these changes and fixes implemented. |
@tracelarue wow, voice command sounds super cool. thanks for your contributions. I cannot wait for trying this feature soon! |
@stex2005 Ready for review Changes made:
|
@tracelarue thank you for the contribution. I will try it :) |
@tracelarue thanks again for your contribution. I've tried your example on WSL and Ubuntu and it still gives some compatibility issues due to hardware/driver changes on camera/microphone. I would suggest making this example a simplified version of the I am working on this simplified example and I will soon share it. |
@tracelarue Thank you so much again for this contribution. We are debating whether client implementations should be in this or another repository. I would keep this PR on hold until we decide. Meantime, I would suggest working on the |
@stex2005 Thanks for the review. I’ll start working on a text-only version and test it on WSL for compatibility. Could you share your hardware setup and specify where the issue occurred? Was the gemini_client able to launch and connect to the MCP? I’ve previously run the live version on a Raspberry Pi, where I had to adjust the mic and speaker sampling rates for hardware compatibility. |
This is the first attempt: https://github.com/stex2005/ros-mcp-client |
We are temporarily closing this PR. Will be implemented in |
Gemini Live for low-latency bidirectional voice interactions with ros-mcp-server.